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SUMMARY  OF  RESEARCH 
AFOSR-86-0189 


This  report  describes  the  results  of  the  work  on  optical  symbolic  computing  that  was 
performed  on  grant  AFOSR-86-0189  during  a  three  year  program.  The  detail  of  the 
research  is  in  the  appended  papers.  This  section  of  the  report  is  to  summarize  the 
motivation  for  the  work,  the  reasons  for  the  approach  taken,  and  the  results.  First,  a 
rationale  is  given  for  the  use  of  optics  in  A.I.,  and  the  approach  taken  is  described.  The 
results  of  the  research  are  then  given. 

MOTIVATION 

Several  areas  of  artificial  intelligence,  notably  knowledge-based  systems,  pattern 
recognition  and  neural  networks  require  so  much  computer  power  that  their  usefulness  as 
problem-solving  techniques  are  limited  by  present  computer  hardware.  Our  ability  to 
address  the  computing  power  limitations  of  conventional  processors  is  limited  by  electronic 
problems  in  the  distribution  of  data  and  control  signals.  These  problems  are  not  present  in 
optical  computing  systems.  However,  conventional  serial  approaches  to  A.I.  problems  do 
not  map  gracefully  onto  optical  processors.  A  new  approach  is  needed. 

Our  work  was  to  develop  an  optical  data  representation;  design,  simulate  and  implement 
basic  optical  operations;  design,  simulate  and  implement  optical  storage  and  data  handling 
modules;  design,  simulate  and  implement  data-dependent  control;  and  simulate  and 
implement  a  prototype  optical  inference  engine.  We  have  developed  the  notation, 
algorithm,  and  logic  for  optical  resolution;  designed  the  resolution  system  using  medium 
scale  optical  building  blocks;  generated  optical  and  functional  simulations;  designed, 
simulated  and  built  prototype  components;  and  performed  a  functional  simulation  of  the 
system. 

RATIQNALkEQR  OPTICS. 

This  project  investigated  the  use  of  optics  in  implementing  a  particular  approach  to 
knowledge-based  systems  -  the  approach  known  as  mathematical  resolution.  This 
combination  of  method  and  technology  must  compete  with  other  technologies,  for  example 
VLSI  and  with  other  methods,  such  as  back  chaining,  (i.e.  PROLOG),  and  production 
rules.  Our  purpose  here  is  to  compare  these  approaches  and  show  why  mathematical 
resolution  using  optics  is  a  viable  and  attractive  approach. 

The  underlying  thread  in  all  knowledge- based  systems  is  that  they  contain  a  large  amount 
of  unstructured  information.  By  "unstructured",  we  mean  that  the  relationship  between 
different  pieces  of  information  is  not  strongly  hierarchal.  Thus  the  process  of  computation 
involves  a  relatively  unselective  examination  of  combinations  of  data  to  find  sequences  of 
information  within  the  data  which  satisfy  the  requirements  of  logic  and  solve  the  problem  at 
hand. 

If  the  data  space  is  large,  many  subsequences  of  information  exist  within  the  system  which 
can  be  independently  discovered.  This  offers  the  possibility  of  increasing  the  speed  of  a 
knowledge-based  system  by  the  use  of  parallelism,  either  electronic  or  optical.  However,, 
the  search  for  subsequences  must  have  overall  coordination  so  that  all  meaningful 
possibilities  are  examined,  but  that  the  same  possibilities  are  not  redundantly  examined. 


There  is  a  tradeoff  between  the  size  of  the  data  space  and  the  complexity  of  the  algorithms 
required  to  manipulate  it.  In  the  simplest  notation,  the  propositional  calculus,  each  entity  in 
the  problem  space  must  be  separately  described  in  the  data.  For  example,  "IF  FELIX  IS  A 
CAT,  THEN  FELIX  HAS  WHISKERS."  If  there  were  many  cats  in  the  problem,  each 
would  require  its  own  statement  about  whiskers.  In  a  more  complex  notation,  the  first 
order  propositional  calculus,  more  general  statements  may  be  made.  For  the  example 
given,  "IF  x  IS  A  CAT,  THEN  x  HAS  WHISKERS."  Hus  reduces  the  size  of  the  initial 
data  space,  but  requires  considerably  more  complex  processing  to  determine  the 
applicability  of  a  statement  to  a  given  situation.  If  the  number  of  entities  (e.g.  cats)  in  a 
problem  is  small,  the  increase  in  data  space  size  may  be  compensated  for  by  the  reduction 
in  the  complexity  of  processing,  and  by  increased  opportunities  for  parallelism. 

Even  with  the  data  notation  chosen,  there  remain  other  significant  choices  about  how  to 
process  the  data.  The  three  primary  methods  are  production  rules  (forward  chaining),  such 
as  used  in  Mycin,  backchaining,  and  mathematical  resolution.  Backchaining  is  the  method 
used  by  the  programming  language  PROLOG,  and  resolution  is  typically  used  for  proving 
theorems  in  mathematical  axiom  systems.  Each  has  advantages  and  disadvantages. 

A  major  advantage  of  forward  chaining  is  that  it  is  relatively  easy  to  incorporate 
probabilities  into  the  data.  Thus,  if  a  given  situation  could  have  multiple  causes,  the 
relative  likelihood  of  the  alternatives  can  be  computed.  In  order  for  this  to  be  practical, 
however,  the  depth  of  the  inference  process  and  the  number  of  alternatives  must  be  small, 
otherwise  the  computation  gets  out  of  hand.  This  is  the  primary  disadvantage  of  forward 
chaining.  The  process  is  not  strongly  goal-directed,  and  in  a  complex  situation  will  (given 
sufficient  time)  deduce  everything  about  the  situation,  in  an  upredictable  order.  If  the 
user's  interest  is  focussed  on  only  one  facet  of  the  problem,  there  is  no  way  to  obtain 
directly  information  about  only  that  facet 

Backchaining  starts  with  a  completely  or  partially  specified  conclusion,  and  reasons 
backwards  to  check  its  compatibility  with  the  data.  It  can  be  used  to  answer  questions  by 
using  conclusions  from  the  first  order  predicate  calculus.  For  example,  the  conclusions  "x 
HAS  WHISKERS"  would  result  in  the  substitution  ”x=FELIX"  if  the  fact  "FELIX  IS  A 
CAT"  were  added  to  the  statement  in  the  earlier  illustration.  In  most  cases,  backchaining  is 
faster  than  production  rules  when  the  desired  outcome  can  be  partially  specified,  because 
the  processing  is  more  narrowly  focussed.  More  general  questions,  such  as  "tell  me  the 
significance  of  the  present  situation",  cannot  be  answered  by  backchaining  systems. 

Both  backward  and  forward  (production)  systems  have  a  strong  directional  basis.  The 
rules  only  work  one  way.  But  from  a  logical  point  of  view,  the  rules  are  actually 
bidirectional.  The  rule  "IF  x  IS  A  CAT,  THEN  x  HAS  WHISKERS"  also  implies  that  if  x 
does  not  have  whiskers,  it  is  not  a  cat.  Mathematical  resolution  is  a  relative  of 
backchaining  which  uses  a  different  notation  (clause  form)  which  captures  this 
bidirectionality.  In  addition,  clause  form  notation  is  simpler  from  an  implementation 
standpoint  than  the  more  general  formula  used  in  forward  and  back  chaining.  Normally, 
mathematical  resolution  is  used  for  proof  by  contradiction.  In  this  form,  a  hypothetical 
conclusion  is  negated,  and  then  the  data  space  consisting  of  the  negated  hypothesis  and  and 
the  known  facts  is  examined  for  inconsistency.  If  one  exists,  the  original  hypothesis  must 
have  been  true.  Resolution  can  also  be  used  in  the  forward  direction  to  merely  generate 
more  facts  from  an  initial  data  space.  In  this  mode,  it  is  as  undirected  as  production  rules  in 
forward  systems. 


Because  of  its  generality  and  simplicity  of  its  implementation,  we  have  chosen  to  study  the 
implementation  of  mathematical  resolution  as  a  parallel  inference  mechanism.  In  order  to 
keep  system  complexity  under  control  at  this  stage  of  the  research,  we  have  restricted  our 
efforts  to  the  propositional  calculus. 

Mathematical  resolution  has  a  large  amount  of  inherent  parallelism,  limited  only  by  the  size 
of  the  data  space.  Two  principal  parallel  operations  are  resolvent  generation  and  duplicate 
detection.  Resolvent  generation  is  the  process  of  combining  existing  clauses  to  form  new 
ones.  This  process  can  be  carried  out  independently  and  in  parallel  for  every  pair  of 
clauses  in  the  data  base.  Of  course,  this  requires  an  adequate  number  of  processing 
elements  and  parallel  access  to  the  data.  Once  new  clauses  are  generated,  it  is  necessary  to 
determine  whether  or  not  they  already  exist  in  the  data  base.  Failure  to  delete  duplicate 
clauses  not  only  causes  the  data  space  to  grow  at  an  unnecessarily  high  rate,  but  it  also  may 
cause  the  computation  to  be  nonterminating.  The  search  for  existing  clauses  which  match  a 
new  one  may  be  done  in  parallel  by  "broadcasting"  the  new  clause  into  the  data  base  and 
waiting  for  an  appropriate  response.  Again,  parallel  access  to  the  data  and  a  sufficient 
number  of  computing  elements  are  required. 

The  computing  operations  required  for  resolution  are  relatively  simple  boolean  algebra  and 
a  small  amount  of  counting.  The  necessary  logic  could  be  fabricated  in  either  VLSI  or 
planar  integrated  optics.  Certainly  the  present  state  of  the  art  in  fabrications  favors  VLSI. 
However,  the  parallel  data  access  requirements  of  the  problem  cannot  be  met  in  VLSI.  A 
system  performing  parallel  logic  on  a  1000  x  1000  grid  would  require  1,000,000  bits  of 
information  per  clock  cycle  arriving  in  parallel.  Pinout  and  crosstalk  limitations  in  VLSI 
would  require  this  information  to  be  partially  serialized  for  loading.  Even  if  the  VLSI  were 
faster  than  the  equivalent  optics  (and  there  is  reason  to  believe  that  the  speed  of  optics  can 
be  made  comparable),  the  time  required  for  serial  data  handling  would  limit  the  speed  of  a 
VLSI  implementation. 

RESEARCH  METHOD 

The  approach  was  to  first  develop  an  optical  data  representation  for  knowledge  and  then  to 
construct  simulation  software  for  all  optical  components  to  validate  system  behavior  and 
reduce  implementation  risk.  The  next  step  was  to  design,  simulate,  and  implement  optical 
computation  modules  for  the  basic  operations  required  for  inference;  optical  storage  and 
data  handling  modules  required  for  data-dependent  computation;  and  the  data-dependent 
control  required  for  inference.  The  ultimate  goal  was  to  simulate  and  implement  a 
prototype  optical  inference  engine. 

As  was  discussed  in  the  Rationale  for  Optics,  the  A.I.  technique  chosen  was  reasoning  by 
resolution.  In  reasoning  by  resolution,  one  must  begin  with  a  set  of  information  about  the 
problem.  Then,  one  takes  the  fact  of  interest  and  states  its  opposite.  If  the  fact  to  be 
proven  is  true,  the  total  set  including  the  negation  now  contradicts  itself.  The  object  of 
resolution  is  to  make  this  contradiction  explicit  The  technique  used  is  to  convert  each  fact 
to  a  binary  bit  string  (a  clause)  encoding  the  data  and  systematically  generate  new  clauses 
by  combining  existing  clauses.  Then,  new  clauses  are  retained  which  contain  exactly  one 
item  of  information  which  was  true  in  one  parent  and  false  in  the  other.  This  item  is 
omitted  in  the  combination  and  all  other  new  clauses  are  discarded.  The  remaining  new 
clauses  are  checked  against  all  existing  clauses  and  duplicates  discarded.  New  clauses  are 
generated  until  either  an  empty  clause  is  generated  or  no  new  clauses  can  be  formed.  If  the 
empty  clause  is  produced,  the  original  fact  was  true,  otherwise  it  was  false. 


The  optical  inference  system  was  designed  around  medium  scale  optical  components. 

Major  components  studied  and  developed  were: 

•  Parallel  read  -  parallel  write  memory  storage  frames  which  allow  parallel 
access  to  entire  matrices  of  stored  data. 

•  Switchable  prism  multiplexors  and  shifters  which  allow  spatial  switching  of 
bit  vector  and  matrix  data. 

•  Optical  pushdown  stacks  which  are  a  combination  of  the  memory  frames 
and  the  prism  shifters.  These  allow  vector-at-a-time  access  to  data 
matrices. 

•  Resolver  (clause  combiner)  using  medium  scale  boolean  logic  to  produce  a 
matrix  of  clauses  from  an  old  matrix  and  a  new  clause. 

•  Duplicate  detector  to  compare  a  new  clause  against  the  existing  clause  base 
in  parallel  to  rapidly  reject  duplicates. 


RESULTS 

A  system  was  designed  and  simulated.  Several  components  were  constructed  and  tested. 
Photo  addressed  spatial  light  modulators  (SLM)  that  are  suitable  for  use  as  memory  planes 
were  designed  and  built.  The  work  is  continuing  on  an  unfunded  basis,  and  the 
demonstration  of  the  use  of  the  photo  addressed  SLM  is  being  done  as  a  master’s  thesis 
project  A  pixilated  SLM  was  designed  and  an  initial  device  fabricated.  That  research  is 
being  continued  at  a  low  level  using  other  funds.  The  unpixilated  photo  addressed  device 
has  better  than  100:1  contrast  ratio,  70  lines/mm  resolution,  and  a  strong  5  KHz  frame  rate. 

Data  representations  were  selected  and  analyzed,  the  system  was  designed  and  functional 
simulation  was  performed.  Phenomenological  simulations  for  the  key  elements  of  the 
system  were  performed.  Two  versions  of  a  prototype  gate  have  been  built  and  tested  and  a 
Proof-of-Concept  2:1  multiplexor  was  built  and  tested.  A  dual  bit  photoaddressed  FLC 
memory  element  has  been  built  and  tested. 

The  system  was  functionally  simulated,  varying  several  system  design  parameters.  In 
approximate  order  of  importance,  these  parameters  are: 

1 )  Parallel  versus  serial  frame  compaction.  In  parallel  compaction,  valid 
resolvents  are  immediately  squeezed  to  the  top  of  the  frame.  In  serial 
compaction,  the  whole  frame  must  be  shifted  and  checked.  This  has  a 
major  impact  on  performance. 

2)  Generation  storage  strategy.  In  separated  storage,  data  generated  in  each 
pass  through  the  system  is  kept  separate  from  previous  data.  This  takes  no 
more  storage,  but  reduces  computing.  In  compacted  checking,  frames  are 
added  to  until  full. 

3)  Extent  of  duplicate  checking.  In  complete  checking,  duplicates  within  the 
same  pass  (generation)  are  detected.  In  partial  checking,  only  older 
duplicates  are  found. 


4) 


Extent  of  duplicate  checking.  In  parallel  checking,  the  entire  data  store  is 
checked  in  one  operation.  In  serial  checking,  one  frame  of  the  data  store  is 
checked  at  a  time.  This  affects  the  performance  if  the  faster  options  of  the 
preceding  choices  are  selected. 

Table  1,  gives  the  result  of  that  simulation  The  particular  problem  chosen  is  given  in 
Appendix  I. 

The  data  representation  chosen  was  the  positional  dibit  notation  for  clauses  (the  basic  unit 
of  knowledge  in  mathematical  resolution).  [1986  SPIE  Conference  on  Optical  Computing] 
This  is  illustrated  in  Figure  1. 

The  fundamental  data  element  in  a  clause  is  the  literal,  which  can  occur  in  asserted  form  (X) 
or  negated  form  (X*),  or  simply  be  omitted.  Each  row  in  the  notation  corresponds  to  a 
clause.  Each  pair  of  columns  in  the  notation  corresponds  to  a  literal,  with  the  bit  pair  01 
used  for  assertion,  10  used  for  negation,  and  00  for  omission.  Each  bit  is  encoded  in 
polarized  light  as  shown  in  Figure  1. 


problem  size:  15  rules,  19  clauses,  31  literals 
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GENERATION  STORAGE  STRATEGY 

TABLE  1  SIMULATION  RESULTS 


The  data  handling  and  storage  used  a  functional  and  data  flow  design  of  a  data  compaction 
system  and  was  described  in  Applied  Optics,  May  1987.  A  review  of  plausible  optical  data 
storage  techniques  was  given  at  the  1987  Computers  in  Aerospace  Conference.  Because 
data-dependent  control  is  needed,  a  review  of  control  techniques  for  mathematical 
resolution  was  performed.  The  set-of-support  strategy  was  selected  as  offering  the  best 
balance  between  efficiency  and  complexity  of  optical  implementation. 

The  optical  resolution  system  is  shown  in  Figure  2.  In  this  figure,  clauses  are  stored  in  the 
"old  generation"  push  down  stack.  Frame-at-a-time  multiplexors  are  used  to  route  current 
information  to  the  resolvent  formation  logic,  which  systematically  combines  a  single  clause 
with  an  existing  frame  to  produce  a  frame  of  potential  new  resolvents.  These  are  checked 
for  validity  and  for  duplication  of  existing  data,  and  useful  new  resolvents  are  placed  in  the 
new  generation.  As  resolution  proceeds,  new  generation  data  is  added  to  the  old 
generations  to  allow  the  process  to  proceed.  Figure  3  shows  the  operation  of  duplicate 
detection. 

Ferroelectric  liquid  crystals  are  key  to  many  of  the  components  of  the  system.  In  one 
application,  the  fast  low-loss  polarization  rotation  characteristics  are  used  and  in  others, 
their  potential  as  a  high-speed  switch  is  employed.  Figure  4  illustrates  both  of  these  uses. 

Figure  5  shows  how  the  switching  of  the  index  of  refraction  can  be  employed  to  fabricate  a 
position  shifter.  This  concept  has  been  demonstrated  and  a  shifter  was  fabricated.  A 
shifter  of  this  sort  could  be  used  to  effect  compaction  of  sparse  matrices  into  dense  matrices 
as  shown  in  Figure  6. 

A  logic  gate  has  been  designed  that  uses  polarization  logic.  It  is  illustrated  in  Figure  7, 
where  the  possible  inputs  are  the  four  combinations  of  polarized  light  shown  at  station  "A". 
The  outputs  at  station  "D"  implement  the  "or"  or  "and"  boolean  logic  function  depending  on 
whether  logic  1  is  chosen  as  horizontal  or  vertical  polarization. 

A  simulation  and  an  experiment  were  performed  as  shown  in  Figure  8.  Robust  operation 
demonstrated. 

A  parallel  storage  device  with  photoaddressed  spatial  light  modulators  (SLMs)  has  been 
designed  and  construction  of  the  photoaddressed  SLM  has  been  completed.  This  device 
has  been  shown  to  have  better  than  100.T  contrast  ratio,  50  lines/mm  and  5  KHz  frame 
rates.  The  storage  device  is  shown  in  Figure  9. 


FIGURE  2,  OPTICAL  RESOLUTION  SYSTEM 


FIGURE  3,  DUPLICATE  DETECTION 


FIGURE  4,  FERROELECTRIC  LIQUID  CRYSTALS 


FIGURE  6,  COMPACTION  LOGIC 


FIGURE  7,  POLARIZED  LIGHT  LOGIC  GATE 


FIGURE  8,  PROTOTYPE  LOGIC  GATE 


APPENDIX] 

A  REASONING  PROBLEM 

(From  Winston,  Artificial  Intelligence,  p.  177) 

1  )  If  the  animal  has  hair  then  it  is  a  mammal. 

2  )  If  the  animal  gives  milk  then  it  is  a  mammal. 

3  )  If  the  animal  has  feathers  then  it  is  a  bird. 

4  )  If  the  animal  flies  and  it  lays  eggs  then  it  is  a  bird. 

5  )  If  the  animal  is  a  mammal  and  it  eats  meat  then  it  is  a  carnivore. 

6  )  If  the  animal  is  a  mammal  and  it  has  pointed  teeth  and  it  has  claws  and  its  eyes  point 

forward  then  it  is  a  carnivore. 

7 )  If  tiie  animal  is  a  mammal  and  it  has  hoofs  then  it  is  an  ungulate. 

8  )  If  the  animal  is  a  mammal  and  it  chews  cud  then  it  is  an  ungulate  and  it  is  even-toed. 

9  )  If  the  animal  is  a  carnivore  and  it  has  a  tawny  color  and  it  has  dark  spots  then  it  is 

a  cheetah. 

10)  If  the  animal  is  a  carnivore  and  it  has  a  tawny  color  and  it  has  black  stripes  then  it  is 
a  tiger. 

11)  If  the  animal  is  an  ungulate  and  it  has  long  legs  and  it  has  a  long  neck  and  it  has  a 
tawny  color  and  it  has  dark  spots  then  it  is  a  giraffe. 

12)  If  the  animal  is  an  ungulate  and  it  has  a  white  color  and  it  has  black  stripes  then  it  is 
a  zebra. 

13)  If  the  animal  is  a  bird  and  it  does  not  fly  and  it  has  long  legs  and  it  has  a  long  neck  and 
it  is  black  and  white  then  it  is  an  ostrich. 

14)  If  the  animal  is  a  bird  and  it  does  not  fly  and  it  swims  and  it  is  black  and  white  then  it 
is  a  penguin. 

15)  If  the  animal  is  a  bird  and  it  Hies  then  it  is  an  albatross. 

PROVE: 

o  If  the  animal  has  hair  and  it  has  hoofs  and  its  color  is  white  and  it  has  black  stripes 
then  it  is  a  zebra. 

o  If  the  animal  has  a  tawny  color  and  it  has  dark  spots  and  it  gives  milk  and  it  chews 
cud  and  it  has  a  long  neck  and  it  has  long  legs,  then  it  is  a  giraffe. 


These  rules  are  translated  into  a  Lisp  input  as: 


(hairy  implies  mammal) 

(lactates  implies  mammal) 

(feathers  implies  bird) 

((flies  and  (lays  eggs))  implies  bird) 

((mammal  and  (eats  meat))  implies  carnivore) 

((mammal  and  (pointed  teeth)  and  claws  and  (looks  forward))  implies  carnivore) 
((mammal  and  hoofs)  implies  ungulate) 

((mammal  and  (chews  cud))  implies  (ungulate  and  (even  toed)) 

((carnivore  and  tawny  and  (dark  spots))  implies  cheetah) 

((carnivore  and  tawny  and  (black  stripes))  implies  tiger) 

((ungulate  and  (long  legs)  and  (long  neck)  and  tawny  and  (dark  spots))  implies  giraffe) 
((Ungulate  and  white  and  (black  stripes))  implies  zebra) 

((bird  and  (not  flies)  and  (long  legs)  and  (long  neck)  and  (black  and  white))  implies 
ostrich) 

((bird  and  (not  flies)  and  swims  and  (black  and  white))  implies  penguin) 

((bird  and  flies)  implies  albatross) 

* 

((hairy  and  hoofs  and  white  and  (black  stripes))  implies  zebra 

In  clause  form  this  becomes: 

1*  +  2 

3*  +  2 

4'  +  5 

6’  +  7*  +  5 
2*  +  8*  +  9 

2*  +  10*  +  11*  +  12*  +  9 

2*  +  13*  +  14 

2*  +  15*  +  14 

2*  +  15*  +  16 

9*  +  17*  +  18'  +  19 

9*  +  17*  +  20*  +  21 

14*  +  22*  +  23’  +  17*  +  18*  +  24 

14*  +  25*  +  20’  +  26 

5*  +  6  +  22*  ♦  23*  +  27*  +  28 

5*  +  6  +  29*  +  27*  +  30 

5*  +  6’  +  31 
• 

1 

13 

25 

20 

26* 
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